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m Abstract 

o 

Compressed sensing is a promising technique in signal processing that attempts to recover sparse 

signal through as few linear and nonadaptive measurements as possible. In this sense, the recovery 

performance is largely determined by the structure of measurement matrix. Recently, {0, 1} binary random 

matrix is found attractive for its low computation load as well as for its comparable performance with 

Gaussian Matrix. However, the optimal distribution of binary random matrix for compressed sensing 

remain unknown in theory. This paper, for the first time, deterministically defines one class of near- 
l-H 

optimal binary random matrix by comparing the restricted isometry property (RIP) of regular binary 

O 
i_i random matrix characterized with the correlation between columns. In theory, the proposed matrix is 

,— I expected to achieve nearly the best RIP with as few nonzero entries as possible. Simulation results also 

> 

^__l confirm the availability of the proposed near-optimal matrix, with better performance over both traditional 

binary random matrix and Gaussian matrix. 

Index Terms 

o 
en 

i-H binary random matrix, compressed sensing, correlation, near-optimal, restricted isometry property, 

> RIP. 



I. Introduction 

Suppose that a /c-sparse signal x € R N , with at most k nonzero entries, is measured by an undetermined 

matrix A C R MxN with M < N 

V = Ax, (1) 
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compressed sensing HI states that one can recover x perfectly from finite observations y G R A/ . This 
problem is customarily formulated as an Zi-minimization based convex optimization problem 

min||x||i subject to y = Ax (2) 

provided that restricted isometry property (RIP) is satisfied. To understand RIP, we first review the concept 
fc -restricted isometry constant (RIC) 6k, which is the smallest quantity obeying 

(1 -5 k )\\x T \\ 2 < \\A T x T \\ 2 < (1 + ^)||x T || 2 (3) 

for all subsets T C {1, ...,iV} with cardinality \T\ < k and all vectors xt £ R' T ', where At denotes the 
set of columns of A with indices in T. And then RIP of order k asserts that the A;-sparse signal can be 
recovered faithfully by Eq. (1), if 62k is less than some threshold, i.e., 0.4652 in [2). Roughly speaking, 
RIP requires that the symmetric matrix A' t At approximates isometry. 

It is well known that some random matrices generated by certain probabilistic processes, like Gaussian 
or Bernoulli processes d 0, satisfy RIP with high probability. In terms of the works related to Johnson- 
Lindenstrauss lemma H Q, these dense matrices are allowed to be sparsely sampled without obvious 
performance loss in compressed sensing. However, due to the randomness in structure and the uncertainty 
on RIP, these random matrices are prohibited in real applications. Consequently, several deterministic 
sensing matrices based on some special codes, like Reed-Solomon codes [6] [7], Reed-Muller codes [8], 
chirp sensing codes J9l etc., are sequentially proposed. In spite of explicit structure, these matrices still 
suffer from uncertain RIP and unstable performance ifTOl . 

Recently, the sparse binary parity-check matrix defined by LDPC codes [11] (shortly called 'LDPC 
matrix' in this paper) attracts our attention for its competitive performance as well as circuit-friendly 
structure [12J. Up to now, the structure and construction for better LDPC matrix in coding theory have 
been widely studied. In contrast, the literature for better performance in compressed sensing is rare. In 
fact, as one class of {0, 1} binary sparse matrix, LDPC matrix has no distinguishable characteristic in 
structure with its original definition iTTTll . except for its particular representation and analysis methods 
based on bipartite graph. In this sense, the research for better LDPC matrix is essentially equivalent to 
searching more feasible sparse binary matrix with better performance while lower sparsity. However, to 
the best of our knowledge, there seems no impressive work to address this problem. In the earlier works 
|[T3l |[T4l ifTBI ifToll . LDPC matrix is only simply used as a common sparse binary random matrix [17], 
without particular evaluations on structure, i.e., girth or correlation between columns. Recently Dimkis 
et al. [18] propose that the good LDPC matrix in channel coding is probably the good sensing matrix 



in compressed sensing, by building a mathematical connection between channel coding and compressed 
sensing with linear programming decoding. Furthermore, they state that the good matrix should have 
large girth. Unfortunately, these theoretical results are still rough from the viewpoint of application. 
Most recently, Li et al. iflOl evaluate one class of LDPC matrix based on Berlekamp-Justesen codes, 
and obtain deterministic RIP and comparable performance with Gaussian matrix. Obviously, all the 
aforementioned works are far away from the ideal goal of defining the better or even best sparse binary 
matrix in compressed sensing. 

In this paper, by exploring the connection between the correlation and the girth of LDPC matrix, for the 
first time, we theoretically define one class of near-optimal {0, 1} binary random matrix for compressed 
sensing, which achieves nearly the best RIP with nearly fewest nonzero entries. Significantly, in practice 
this kind of matrix can be approximately determined and constructed with progressive edge-growth (PEG) 
algorithm |[T9l . Numerical simulations also show that, the proposed near-optimal matrix indeed exists 
with better performance over both traditional binary random matrix and Gaussian matrix. In the following 
study, to distinguish with previous studies on binary matrix, we prefer to use 'LDPC matrix' to refer to 
the sparse binary random matrix characterized with the tool: bipartite graph, popularly used for LDPC 
codes. 

Moreover, it is necessary to mention that the accurate calculation of RIP has been regarded as a 
difficult or even impossible assignment J20l . For instance, recent works [21] [22] have proved that the 
RIP solution to a given matrix with a given order k is NP-hard. However, the computational obstacle 
can be penetrated by analysis ll23l . To search best binary random matrix, this paper will approximate the 
RIP of random matrix by analyzing its distribution. As will be detailed later, the solution to RIP can be 
derived by seeking the extreme singular values of At IJ231 . Unfortunately, the accurate or comparable 
extreme singular values of random matrix, cannot be well approximated with current random matrix 
theory ll24ll . Recall that the paper focuses on the comparison between RIPs rather than accurate RIP. 
Thus, in this paper the solution to the extreme singular values of At is roughly relaxed to calculating the 
extreme eigenvalues of symmetric matrix of similar distribution with A' t At- In other words, A' t At is 
simply regarded as symmetric rather than positive semidefinitive. As it is known, the extreme eigenvalues 
of symmetric matrix has been widely studied in random matrix theory, and accurate solutions can be 
derived or approximated on some constraints. 

The rest of the paper is organized as follows. In next section, the regular binary random matrix is 
defined by two types of LDPC matrices of different girth distributions. Relative concepts about Tanner 
graph and girth are also introduced. In section III, the RIPs of above two types of LDPC matrices are firstly 



calculated and analyzed in Theorems 1-3, and then the near-optimal binary random matrix are further 
derived by the comparison of RIPs in Theorem 4. In section IV, the empirical performance of proposed 
near-optimal matrix constructed with PEG algorithm is verified, by comparing with LDPC matrix with 
different sparsity as well as other popular matrices. Finally, we end this paper with a conclusion in section 
V. 

II. Fundamentals of LDPC Matrix 

In the study of LPDC codes, LDPC matrix tends to be represented by Tanner Graph (in Definition 1), 
and analyzed with girth (in Definition 2, abbreviated as g). According to the distribution of girth, this 
paper, for the first time, categorizes regular binary random matrix without same columns, into two types 
of LDPC matrices: LDPC matrix with girth g > 4 (in Definition 3) and LPDC matrix with girth g = 4 
(in Definition 4). This novel definition will be helpful for the following computation of RIP Currently, 
LDPC matrix with g > 4 can be constructed with numerous algorithms thanks to wide studies on LDPC 
codes. In contrast, it seems that there is no off-the-shelf knowledge to the construction and analysis of 
LDPC matrix with g = 4 due to its bad performance in LDPC codes. So this paper cannot sufficiently 
describe its structure in the following study. Typically, to express the variation of the correlation between 
columns with degree d varying, where d denotes the number of nonzero entries in each column, the 
columns of LDPC matrix are normalized, and thus nonzero entries are set to \j\fd instead of 1. 

Definition 1 (Tanner Graph): A bipartite graph holds N variable nodes and M check nodes, 
respectively corresponding to N columns and M rows of binary matrix {0,l} MxN . Variable nodes 
and check nodes are connected by the nonzero entries of binary matrix. 

Definition 2 (Girth): In Tanner graph, the girth is defined as the length of the shortest cycle through 
all variable nodes. 

Definition 3 ( LDPC Matrix with girth g > A): A binary matrix A(M,N,d) C {0, l/V~d} MxN , 
consists of 2 < d < M — 2 nonzero entries per column and Nd/M nonzero entries per row. In structure, 
any two columns are allowed to share at most one same nonzero position. In adjacent Tanner graph, the 
girth is larger than 4. 

Definition 4 ( LDPC Matrix with girth g = A): A binary matrix A(M, N, d, s) C {0, l/V~d} MxN , 
consists of 3 < d < M — 2 nonzero entries per column and Nd/M nonzero entries per row. The largest 
correlation value between two distinct columns is s/d with 2 < s < d— 1. In adjacent Tanner graph, the 
girth is equal to 4. 



For the performance of sensing matrix, the correlation between distinct columns has been a fundamental 
index. Fortunately, the correlation of LDPC matrix with g > 4 can be statistically determined as shown in 
Lemma 1. Conversely, as the former states, it is hard to detail the correlation distribution of LDPC matrix 
with g = 4, due to limited knowledge. But one can roughly conjecture that its nonzero correlation values 
are taken from the set {1/d, ...,s/d}. Therefore, with the rough relation between the largest correlation 
and compressed sensing ll25ll 

fc<i(i + i/ M ) (4) 

where p := max^da^a-,!} = 1/d or s/d for above two types of LDPC matrices, it is reasonable to 
expect that the larger d probably defines the better LDPC matrix. Also, this paper is developed with this 
inspiration. 

Lemma 1 (Correlation of LDPC matrix with g > 4): Any two distinct columns of LDPC matrix 
A(M, N, d) in Definition 3 take on correlation values 



/ 



1/d with probability p = f£f^£ 
with probability 1 — p 



where aj and aj denote two distinct columns of A(M, N, d). 

Proof: In Tanner graph associated with A(M,N,d), any variable node Vi, i G {1,...,JV}, holds d 
neighboring check nodes c bk , where the subscript bk G C C {1, .. ., M} denotes the index of check node, 
k G {1, ..., d}, \C\ = d; each check node c bh further connect with other jj — 1 variable nodes Vj, where 
j G Vb k C {1, ...,iV} \ i represents the index of variable node, |V& k | = ^4 — 1. Since variable node 
Vi has girth> 4 in Definition 1, we have that V6 e |~)VL = 0, where e, / G {1, ...,&} and e / /, then 
derive | V bl \J V b2 [j ... \J V bk \ = d X (^ - 1). Therefore, among N - 1 variable nodes, there are Nd ^ Md 
variable nodes connecting with variable node vi through one check node. Equivalently, any column of 
A(M, N, d) has Nd ^ Md correlated columns with correlation value 1/d. Then formula (5) is proved. ■ 

III. LDPC MATRIX FOR COMPRESSED SENSING 

In this section, the RIPs for LDPC matrices with g > 4 and g = 4 are first derived and analyzed in 
Theorems 1-3, and then the near-optimal LDPC matrix is proposed and discussed by comparing RIP in 
Theorem 4. 
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Fig. 1. (a) Maximum, mean and minimum of the proportion of nonzero entries in the off-diagonal of A' t At, \T\ = k, for 
LDPC matrix A (400, 200, 7). The mean is equal to the theoretical value p — 0.2281 in Lemma 1. (b) The probability that the 
proportion p of nonzero entries in the off-diagonal of A' t At centers on p with error bound \p — p\/p. Each point of (a) and 
(b) is derived with 10 3 simulations. 



A. RIP of LDPC matrix with girth larger than 4 

With the definition of LDPC matrix with g > 4, it is easy to derive that A' T A T C {0, 1, l/d} kxk 
with T € {1, ...,-AT} and \T\ = &R is a positive semi-definite symmetric matrix with the diagonal equal 
to 1, and the off-diagonal of the distribution illustrated in Lemma 1. Theoretically, the solution to the 



'Suffering from some possible confusion with the former notation in RIC, we simply assume \T\ = k in the following part. 



RIC-<5fc of A' t At, can be transformed to the pursuit for the extreme eigenvalues of A' t At (equivalently, 
the square of the extreme singular values of At), since 

x'^ALAtxt 

l-S k <X k < T T T T < Al < ! + ^ (6) 

x t xt 
where Ai and A^ represent the two extreme eigenvalues of A' t At- Note that, in this paper Ai > A2... > A& 
are customarily used to denote the order of eigenvalues of A' t At- 

As the introduction states, it is hard to directly derive the extreme singular values of At, and so 
we turn to calculate the extreme eigenvalues of random symmetric matrix of similar distribution with 
A' t At- In this paper, we prefer a simple algebra algorithm ll26l rather than other popular results based on 
semicircle law E71l . for the solution of extreme eigenvalues. The major advantage of the former is that it 
can provide an accurate solution for random symmetric matrix, if the random matrix could achieve some 
special distribution. In contrast, the accuracy of the latter can only be expected if the matrix size k is 
sufficiently large. Obviously, this is unfavorable for RIP usually with small k. In the following Theorem 
1, the RIP-1 of LDPC matrix with g > 4 is derived with J26). 

Theorem 1 (RIP-1): LDPC matrix A(M, N, d) in Definition 1 satisfies RIP with RIC 

3fc-2 

** = U + k-2 (7) 

With S 2 k < 0.4652 0, further derive k < 0.3671d + 0.2110, for the faithful recovery based on h- 
minimization. 

Proof: Please see Appendix A. ■ 

Note that, as the proof detailed in Appendix A, the two extreme eigenvalues for RIP- 1 are achieved on 
the assumption that the proportion p of nonzero entries in the off-diagonal of A' t At, could take value 
1 or 0.5, for any \T\. However, as Lemma 1 discloses, this condition cannot be satisfied all the time, 
because the proportion p should center on p < 1 with higher probability as \T\ increases. For better 
understanding, we further give an example in Figure 1, in which the proportion p for LDPC matrix 
^4(200, 400, 7) fast converges to the theoretical value p = 0.2281 < 0.5, as \T\ increases. In this case, for 
the large LDPC matrix with RIP of order k large enough such that p = p, its RIP-2 can be asymptotically 
evaluated with semicircle law [27], as shown in Theorem 2. Note that, Theorem 2 is only feasible for 
LDPC matrix with RIP of order k large enough such that the condition for semicircle law, \T\ — > 00, 
could be well approximated. Thus, considering generality and accuracy, RIP-1 rather than RIP-2 is used 
in the following comparison between RIPs. 



Theorem 2 (RIP-2): Assume that the off-diagonal elements of A' t At take nonzero values with 
probability p = yjffM, for any \T\(\T\ - \)p > 2, then drive that, the RlC-5 k of A(N,M,d) can be 
approximately formulated as 

fcp - 2^(1 - p) + 3 
if k = \T\ -> oo. 

Proof: Please see Appendix B. ■ 

B. RIP of LDPC matrix with girth equal to 4 

For LDPC matrix A(N, M, d, s) with g = 4 and the largest correlation p = s/d, the off-diagonal of 
A' t At possibly takes values from the set {0, 1/d, ..., s/d}, where 3 < d < M — 2 and 2 < s < d — 1. 
With the same solution algorithm Ii26l to Theorem 1, the RIP-3 of LDPC matrix with g = 4 is derived 
in Theorem 3. 

Theorem 3 (RIP-3): LDPC matrix A(M, N, d) with g = 4 and n = s/d , where 2 < s < d - 1 and 
3 < d < M - 2 satisfies RIP with RIC 

nr^tr%^ if 3<d<^f and 2<s<d-l 

Proof: Please see Appendix C. ■ 

Note that, like RIP-1, the two extreme eigenvalues for RIP-3 are also achieved as the off-diagonal 
elements of A' t At can take the maximal nonzero value s/d with probability 1, or take binary value 
{0, s/d} with equal probability. Nevertheless, as the section II states, in practice it remains unknown 
whether the above distributions could be well satisfied merely with limited knowledge about LDPC 
matrix with g = 4. And so this paper cannot ensures that the extreme eigenvalues for RIP-3 possibly are 
surely achieved by the practical matrix. 

C. Our contributions: Near-optimal binary random matrix 

This section attempts to theoretically define one class of near-optimal binary random matrix in Theorem 
4, by comparing the RIPs of LDPC matrices with different sparsities/degrees. 

Theorem 4 (Near-optimal binary random matrix): The largest degree d max of LDPC matrix 
A(N, M, d max ) with g > 4, defines the near-optimal binary random matrix for compressed sensing, 
which obtains nearly the best RIP ( or least RlC-5k) with as few nonzero entries as possible. 



Proof: The near-optimal LDPC matrix A(N,M,d rnax ) is derived by comparing itself with other 
LDPC matrix A(N, M,d < d max ) with g > 4 and LDPC matrix with g = 4. 

1) If d < d max , 

• compared to LDPC matrix with g > 4: 

with RIP-1 or RIP-2, it is easy to derive that the RIC-<5fc decreases as the degree d increases. 
Thus, LDPC matrix with g > 4 achieves best RIP as d = d max . 

• compared to LDPC matrix with g = 4: 

by comparing RIP-1 and RIP-3, it is easy to derive that 4 ]^ 2 < J^T 2 ^ , if2<s<d-l 
and 3 < d < M/2 (note that, according to 1 + d(j[f - 1) < N (FB, one can approximately 
derive d max < \J~M < M/2). This indicates that the RIC-4 of LDPC with g > 4 is less than 
that of LDPC matrix with g = 4. Thus, LDPC matrix with g > 4 and d ma:r still holds better 
RIP, if d < d max . 

2) If d > d max , 

• compared to LDPC matrix with g = 4: 

by comparing RIP-1 and RIP-3, LDPC matrix (M,N,d max ) with g = 4 holds better RIP 
in the following two cases: first, if d ma:r < d < M/2 , let M ^~+ k _ 2 < n^^g , derive 

(3fc-2)s+(fc-2)(M-2d) 

(fc-2)s-(M-2d)fc+2M ' 

6s+2(2d-M) 



that d ma;E > d/s; 5eCO nd, if M/2 < d < M - 2, let 4d ^- 2 fc _ 2 < l^z ^-'.^."-"-^ 
approximately derive that d max . > ^ ^ — = — ^ 



There are three major factors rendering the near-optimal rather than optimal property of Theorem 4. 
First, as stated in the proof, when d > d max , the near-optimal matrix A(M, N, d max ) is derived under two 
constraints: d max > d/s and d max > 6s+2 f2d- M ) • ^ or tne ^ rst constraint, it is reasonable to conjecture 
that the better LDPC matrix with g = 4 and d > sd max , dose not exist, since it is contrast to the fact 
that one can derive at most d = d max when s = 1, in terms of the definition d max for LDPC matrix with 
g > 4 in Theorem 4. And for the second constraint, unfortunately it is hard to determine whether the ideal 
d for better LDPC matrix with g = 4 exists or not with the complex formula. Nevertheless, note that, 
the second constraint is derived from d > M/2, which is beyond our practical interest for sparse matrix. 
Second, it is necessary to recall that both RIP-1 and RIP-3 are derived on some special assumptions, 
which cannot be well satisfied as \T\ increases. Specially, as the former states, for LDPC matrix with 
5 = 4, the practical RIP may be better than RIP-3, since empirically it is hard for the practical matrix to 
obey the assumption well as \T\ increases. Third, it should be noted that this paper only investigates the 
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regular LDPC matrix, which is expected to share same number of nonzero entries between columns or 
between rows. In fact, Theorem 4 could also be roughly extended to irregular LDPC matrix with g > 4, 
by proving that the larger average degree d implies better RIP. In practice, compared to the proposed 
near-optimal LDPC matrix, irregular LDPC matrix usually can achieve a better RIP thanks to its larger 
average degree lfi"2l . 

IV. Simulation results 

A. Simulation setup 

To verify Theorem 4, this section first studies the performance of LDPC matrix over varying degrees 
d. Considering the sparsity of input signal is usually uncertain and the noises are also inevitable in real 
applications, LDPC matrix is also evaluated under above two cases. For comparison, traditional binary 
random matrix of varying degrees and Gaussian matrix are also tested. 

The near-optimal binary random matrix, LDPC matrix ,4(200, 400, d max = 7) is constructed with PEG 
algorithm. As a suboptimal greedy algorithm, PEG is suitable to explore the largest d of LDPC matrix 
with g > 4. To obtain near-optimal sparsity by comparison, LDPC matrices ,4(200, 400, 1 < d < 7) 
with g > 4 and LDPC matrix ,4(200, 400, 7 < d < 100) with g = 4 are also constructed with PEG 
algorithm. Moreover, note that this paper cannot provide the examples of LDPC matrix with g = 4 and 
some given p, = s/d, where 2 < s < d— 1, since there is no construction algorithms as section II states. 
As particular examples, traditional binary random matrices with 1 < d < 100 nonzero entries randomly 
and uniformly distributed in each column, denoted as R(M, N, d) with the largest correlation \i = s/d 
and s = d — 1, are proposed for comparison. 

Existing solution algorithms to Eq. (2) can be roughly categorized to two classes: convex optimization 
algorithms (basis pursuit) [28] and greedy algorithms, like orthogonal matching algorithm (OMP) lf29l 
P0l . Since the experiments are developed based on performance comparison rather than pursuing best 
performance, OMP is used here for much faster simulations. The input sparse signal is generated from 
A^O, 1) and then normalized. All simulation results are averaged after 10 4 iterations, and both binary 
random matrix and Gaussian matrix are randomly generated each iteration. The correct recovery rates 
are measured with 1 — \\x — xj^/IMh- 

B. Near-optimal property in both performance and sparsity 
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TABLE I 
The largest sparsity level k that can be recovered with probability larger than 99%, for LDPC matrix 

A(200, 400, d) AND BINARY RANDOM MATRIX R(200, 400, d) . 
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k 


LDPC 
Random 






29 



70 

55 


75 
69 


78 

73 


80 

75 


81 

76 


83 

76 


83 

76 


81 
76 


80 
76 


79 
76 


78 
76 


78 
76 


77 
76 


75 
76 


74 
76 


48 
76 


26 
76 


2 
76 



In Table 1, the largest sparsity level k that ensures recovery rates larger than 99% are presented for 
LDPC matrix binary random matrix with varying d. At the same time, the largest k for Gaussian matrix 
of size (200, 400) is also derived as 76. Obviously, with same error tolerance, larger k indicates better 
sensing matrix. As it is expected, the near-optimal binary matrix, LDPC matrix ^4(200, 400, d max = 7) 
achieves nearly the best performance with k = 81, larger than k = 76 for both binary random matrix 
and Gaussian matrix, while slightly worse than k = 83 for LDPC matrices with d G {8, 9}. Recall that, 
since PEG algorithm attempts to reduce the overlap rates of nonzero positions of distinct columns, LDPC 
matrices with g = 4 constructed with PEG algorithm tends to take correlation value \jd with higher 
probability rather than 2/d or others, as d is slightly larger than d max . Therefore, the performance gain of 
LDPC matrix with g = 4 and rfG {8,9} over near-optimal LDPC matrix with g > 4 and d max = 7, can 
be explained by the fact that its correlation values take 1/d (< l/d max ) with much higher probability 
rather than s/d (> l/d max ), 2 < s < d — 1. In this sense, this kind of LDPC matrix can be roughly 
regarded as LDPC matrix with g > 4 and d > d max , and so the better RIP can be reasonalby conjectured 
with Theorem 4. Empirically, as Table 1 shows, the performance LDPC matrix constructed with PEG 
algorithm, begins to degrade as the difference d — d max > 2 Il2l . By this observation, one can practically 
construct better LDPC matrix than the near-optimal LDPC matrix with PEG algorithm. 



C. Performance over input signals of varying sparsity or noises 

In Figure 2, near-optimal LDPC matrix is compared with binary random matrix i?(200, 400, 7) and 
Gaussian matrix over varying sparsity levels k. Binary random matrix and Gaussian matrix present 
comparative performance, while LDPC matrix significantly outperforms them. Note that the binary 
random matrix with d = 7 has achieved its best performance as shown in Table 1. Similar results 
are also observed in Figure 3, where the normalized sparse signal of k = 40 is perturbed by Gaussian 
noise N(0,a 2 ). 
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Fig. 2. The recovery rates of input signals with varying sparsity level k, for LDPC matrix A(200, 400, 7), binary random matrix 
i?(200,400, 7) and Gaussian matrix of size (200,400). 
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Fig. 3. The recovery rates of normalized input signals with Gaussian noise iV(0, a 2 ), for LDPC matrix A(200, 400, 7), binary 
random matrix i?(200, 400, 7) and Gaussian matrix of size (200, 400). 



V. Conclusion 

This paper has theoretically defined one class of near-optimal {0,1} binary random matrix with lowest 
sparsity to achieve nearly the best RIP. And the simulation results also verify that the near-optimal binary 
matrix constructed with PEG algorithm, indeed exists with better performance over traditional binary 
random matrix as well as Gaussian matrix. In future, it is attractive to search the largest degree d max for 
near-optimal LDPC matrix in both theory and practice. 
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Appendix A 
Proof of Theorem 1 

Proof referring to [26 J: As formula (6) discloses, the solution to RIC-<5fc can be reformulated as the 
pursuit for the extreme eigenvalues of symmetric matrix A' T Ax C {0, 1, l/d} kxk , \T\ = k. Customarily, 
as the former states, we denote by Xi(A' t At) > ■ ■ ■ > \k{A' T AT) the eigenvalues of A' t At- 
1) Let B = A' T A T - I C {0, l/d} kxk , then B it = and %^j = or 1/d. 

Let normalized x = (x\, . . . ,Xk)' be the eigenvector corresponding to Xk(B). Then the minimal 
eigenvalue can be formulated as 

X k (B) = x'Bx = t'[B o (xx')]t 

where o denotes the Hadamard product and 1 = (1, . . . , 1)' E M fc . Since B is symmetric, by 
simultaneous permutations of the rows and columns of B, we can suppose xi > for i = 1, . . . , n 
and Xi < for i = n + 1, . . . , k, and then xx' is divided into four parts: 



xx 



Xr, 



x. 



nx(k-n) 



-X-(k—n)xn -^-(k-n)x(k-n) 



B 



where the entries in X nxn and -Xa._ n ) x (fe-n) are nonnegative, while the entries in X nx ^ k _ n ^ and 
Xi k _ n \ xn are nonpositive. Further, define a novel matrix B of same size with B 

x H- rax ra 2 x In.x(fc-n) 

d x l(fc-ra)xn x l(fe-ra)x(fc-ra) 
where l aX (, is an a x 6 matrix with all entries equal to 1. It is easy to deduce that 

Xk(B) = nim{y f By : \\y\\ = 1} < x'Bx < x'Bx = Xk(B). 

Since the rank of B is at most 2, it has at most two nonzero eigenvalues. Considering the trace 
and the Frobenius norm, we have 



X k (B) 



n(k — n) 



d 2 



0<n<k. 



If k is even, \ k (B) > -53, with '=' at n = k/2 
If k is odd, Afc(-B) > 



•^. with 



at n = (fc - l)/2 or n = (k + l)/2. 



Then X k (B) > X k (B) > — A, with the limitation attained at k is even and n = k/2. 
So, we have the minimum eigenvalue X k (A' T AT) > 1 — ^. 



2) Let C = ^A T 



d-l 



/, then Ca = 1/d and Cy^j = or 1/d . 
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Let normalized x = (x\, . . . ,Xk)' be the eigenvector corresponding to Ai(C). By simultaneous 
permutations of C and x, we can suppose Xi > for i = 1, . . . , n and X{ < for i = n+ 1, . . . , k, 
and the maximal eigenvalue is formulated as 



Further define 



then 



C 



Ai(C) = x'Cx = t'[Co(xx')]t. 



x l(fe— n)xn 5 x *-(k— n)x(k— n)i 



Ai(C) = maxjy'Cy : ||y|| = 1} > x'Cx > x'Cx 
= \i{C). 
Since the rank of C is at most 2, it has at most two nonzero eigenvalues. Considering the trace 



and the Frobenius norm, we have 



Ai(C-) 



k + \k- 2n\ 
2d 



Then \\{C) < X\(C) < H, with '=' at n = or n = k. And further derive 



Xi{A! T A T ) = Ai(C) + — — < 



3) Finally, we can deduce 



h 



Ai(i4^A T ) - A fc (^' T ^T) __ 3A; - 2 



Ai(A' T A T ) + A fc (^' T ^ T ) 4d + fc - 2' 



with ^fti = a El- 



Appendix B 
Proof of Theorem 2 

Proof: To derive the extreme eigenvalues of A' t At, we first search the extreme eigenvalues of 

B = {A' T A T - I) 

where I is an identity matrix. And clearly B is a symmetric matrix of the diagonal elements equal to 0, 
and the off-diagonal elements equal to 1 with property p and with property 1 — p. 
Further, suppose ll32l 
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where 1 is a all-ones matrix. Then Q has entries with mean zero and variance one. With semicircle law 
E71 , the extreme eigenvalues -jrQ, k = \T\, can be approximated as 

-2 < A(4=Q) < 2 
namely, 

-2y/kp(l ~ p) < KB ~ pi) < 2y/kp(l-p), 



if k -*■ oo 1331 . 

With cauchy interlacing inequality [34], we can further derive that 

Xi(B - pt) < Xi(B) < \i- X (B - pi) 

for 1 < i < k, if B — pt and pi are Hermitian matrices, and pi is positive semi-definite and has rank 
equal to 1. As a result, it is easy to derive that 

X 2 (B) < Ai(S - pi) < 2y/kp{l - p) 

and 

\ k {B) > X k (B - pi) > -2y/kp(l-p) 

As for Xi(B) it is known that 06l 

Ai(B) wfcp + 1 

In this sense, the extreme eigenvalues of A' t At can be approximately formulated as 

Xi{A' T A T ) = XiB + l<kp + 2 

and 

X k (A' T A T ) = X k B + 1 > -2y/kp(l-p) + 1 

Therefore, the RIC of A' t At can also be deduced 

s = Ai-Afe = fcp + 2^fcp(l-p) + l 
Ai + Afc A;p-2 v //cp(l-p)+3 

■ 

2 In 1351 . it is proved that Ai(_B) ~ fcp, as kp is sufficiently large. 
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Appendix C 
Proof of Theorem 3 

The proof is similar to that for Theorem 1 in Appendix A. So in the following we just give a sketch. 

Proof: 
1) If 3 < d < M/2, [A' T A T }ii = 1 and [A' T A T ] ijMj 6 {0, ... , s/d}, 2 < s < d-1, for i, j = 1, . . . , k. 

a) Let B = A' T A T - I, 

{—sk/2d if k is even 

—s\/k 2 — l/2d if k is odd 

then \ k {A' T A T ) = 1 + X k (B) > 1 - §§. 

b) Let C = A' T A T - (1 - § )J, derive Ai(C) < ks/d, then X^A^At) < 



(k-l)s+d 



2) if M/2 < d < M - 1, [A' T A T ]ii = 1 and [A' T A T ] ijMj e {(2d - M)/d, . . . , s/d}, 2d - M < s < 

d — 1, for i,j = l,...,k. 

k(2d-M-s) ■ t , ■ 

— — 23 */ fc zs even 

a) Let 5 = A^t-U-^^U, derive X k (B) > { 

1 k(2d-M)-J(2d-M) 2 -<k 2 -l)s 2 . „ , . 

— 23 l J k %s odd 

a \ ^ k(2d-M- 

A T ) > 2d 

b) Let C = A' T A T - (1 - |)J, derive Ai(C) < fes/d, and then it follows that Ai(^4 T ) < 



further derive A fc (E) > -MM^zf), and then we have that A fc (A' T A T ) > fc(2d-M-,)+2(M-d) 



(k-l)s+d 



3) Finally, with 5 fc = x^P^, derive 



5fc = < 



„ (3 l 2 i >S , J i/3<d<fand2<s<d-l 

(fc— 2)s+4d j — — 2 — — 
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